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Abstract: We propose a new method to resolve combinatorial ambiguities in hadron 
colHder events involving two invisible particles in the final state. This method is based 
on the kinematic variable Mt2 and on the M7^2-assisted-on-shell reconstruction of invisible 
momenta, that are reformulated as 'test' variables Tj of the correct combination against the 
incorrect ones. We show how the efficiency of the single Tj in providing the correct answer 
can be systematically improved by combining the different Tj and/or by introducing cuts 
on suitable, combination-insensitive kinematic variables. We illustrate our whole approach 
in the specific example of top anti-top production, followed by a leptonic decay of the W 
on both sides. However, by construction, our method is also directly applicable to many 
topologies of interest for new physics, in particular events producing a pair of undetected 
particles, that are potential dark-matter candidates. We finally emphasize that our method 
is apt to several generalizations, that we outline in the last sections of the paper. 
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1 Introduction 

Many models providing WIMP dark-matter candidates predict events at the Large Hadron 
Collider (LHC), characterized by a pair of invisible particles in the final state. This very 
same topology is also available in the Standard Model (SM), e.g. in dileptonic decays 
of a top-quark pair: ti — )• M'^uM~i>. Several kinematic methods have been proposed to 
determine the particle masses in such missing energy events [1]. On the other hand, to 
access information beyond the mass, e.g. the spin and/or chirality of parent particles, it is 
very often highly desirable, sometimes necessary, to reconstruct the parent particles' full 
momenta event by event. Such reconstruction generically suffers from the combinatorial 
ambiguity of correctly assigning visible particles to the given event topology. For instance, 
there are two possibilities to pair the two 6-jets and the two charged leptons in the dileptonic 
decay of a top quark pair, and three possibilities to group the jets into two dijets in 
the gluino pair decay gg — t- jjXiJJXi supersymmetric models, Xi denoting the LSP 
neutralino. Not to mention that the possible combinations proliferate when the event 
involves multi-step cascade decays and one wishes to identify the order of visible particles 
in each decay chain, besides the assignment to the correct decay chain. For instance, in 
the event X2X2 i^i^i^i"^ ^^^'Xi^'^^Xi, one has in total 2x2x2 = 8 possibilities 
to assign the charged leptons to the event topology (assuming they have all the same 
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flavor), and even more to correctly assign jets in the cascade decay gg jqjq — )■ jjXiJjXi 
(3x2x2 = 12). 

Quite generally, it is clear that minimizing combinatorial uncertainties is one of the 
first steps toward an accurate reconstruction of the full parent particles' momenta in miss- 
ing energy events.^ Actually, this problem is already relevant within the SM, with no need 
to invoke new-physics events. A well-known example is that of dileptonic tt events in the 
SM, where the reconstruction of the t and t momenta is required in order to study spin cor- 
relations in tt [3-6] , which in turn allow to probe the quantum structure of the decay much 
more thoroughly than the information from the cross section alone does. Broadly speaking, 
strategies aimed at fully reconstructing the momenta of parent particles in missing-energy 
events are of obvious relevance when it comes to making the most out of the LHC data. 

In this paper, we aim to develop a model-independent way of reducing combinatorial 
uncertainties for missing energy events at the LHC. We propose a new method based on the 
combined use of the kinematic variable Mt2 [7, 8] and of the MT2-assisted-on-shell (MAOS) 
reconstruction of missing momenta [9]. All these kinematic variables are reformulated as 
'test' variables Tj (testing the correct pairing of final-state visible particles against the 
incorrect ones) and the information from the various Tj is then used combinedly, in order 
to improve over the performance of each Tj separately. 

We apply this strategy to the dileptonic tt decay, that, as mentioned, represents a 
prototype example of pair production of a parent particle of interest decaying into partly- 
invisible daughters. Besides its intrinsic interest, the decay offers a popular event topol- 
ogy in SM extensions with a dark-matter candidate, rendered stable by a conserved Z2-like 
symmetry. Applying our method to dileptonic ti, we find an efficiency of determining the 
correct partition in the ballpark of 90%, and that this efficiency can be systematically 
increased by introducing cuts on certain partition-insensitive kinematic variables, at the 
price of an actually moderate loss in event statistics. 

The organization of our paper is best summarized in the table of content. In sec. 2 
we start with an example of the problem at hand, and then introduce our test variables Tj 
in terms of and of the MAOS algorithm, applied to the full tt decay or to its WW 
subsystem. We here examine the efficiency of each variable Tj in the context of a parton- 
level analysis. In sec. 3, we address the issue of improving over the efficiency of each single 
Ti, by devising a combined test, and introducing the above mentioned cut variables. In 
section 4, we discuss some generalizations of our method, in particular its application to 
generic missing energy events producing a pair of dark matter particles in the final state. 
Finally, in sec. 5 we conclude, providing an outlook of future work. 

""^For example, see the method recently proposed in ref. [2]. 
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2 Test variables to choose among partitions of the visible final states 



2.1 An example 

We will state our problem of interest in the example of tt production followed by fully 
leptonic W decays. The decay chain is namely 

bW+ hW- hi+v bi-j? . (2.1) 



tt 

At the level of reconstructed objects, this 
process is triggered on by, e.g., requiring 
at least two jets (two of which possibly b- 
tagged), two leptons and missing energy in 
the final state. Clearly, the triggered-on 
objects can be partitioned in two different 
ways, namely 
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— right 
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where the index labels the decay chain. In 
the rest of the text we will refer to the dif- 
ferent ways of grouping visible final states 
into two sets as partitions. 

Our aim is to construct test variables 
able to distinguish the right from the wrong 
final-state partitions. To this end, one 
may exploit suitable event variables, that 
namely display certain predictable features 
as a function of the event kinematics - like 
edges, thresholds or peaks. The main ob- 
servation to take advantage of is that these 

features are present, and calculable from the measured final states, provided the partition 
of the visible particles is the right one, whereas they are partly or completely destroyed if 
the final-state partition is incorrect. 

The arguably simplest and best-known example to illustrate such ideas is the invariant 
mass of the visible particles, my^, belonging to a given decay chain i. In the case of the 
event topology (2.1), the visible particles of each decay chain are a 6-quark and a charged 
lepton. The corresponding invariant mass distribution enjoys an upper endpoint given by 
the formula 

1 , o 
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Figure 1. The niM distribution for the tt 
event topology (2.1) with right (solid black) and 
wrong (red dashed) partitions of the b quarks and 
charged leptons. The event generation details for 
this and the rest of our plots are described in 
sec. 2.2. 
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This endpoint feature is clearly displayed by the black solid histogram in fig. 1, representing 
the nihi distribution when correctly partitioning the final states. The same feature does 



-3- 



not need to hold in case rriM is calculated with the incorrect final-state partition: the 
corresponding histogram is also shown as a red dashed curve. Indicating the two possible 
partitions as Pi and P2, one can then construct the difference between the m;,^ values 
calculated with either partition, namely 

AT{P2, Pi) = T{P2) - T{Pi) , (2.3) 

where 

T{Pi) = muiPi) (2.4) 

is the test variable for a given partition Pj. From our previous considerations, one can 
expect that, if Pi corresponds to the right partition, P/j, and P2 to the wrong one, Pw, 
then Ar(Pi4/, Pr) is expected to be larger than zero, at least for a subset of the possible 
kinematic configurations of the visible final states. 

We therefore introduce the following criterion for using Ar(P2, Pi) as a variable testing 
the correct final-state partition in the case of dileptonic ti decay: 

Given a final state of 2b-jets +21, they can he partitioned in two possible ways, to he called 
Pi and P2. 7/ Ar(P2, Pi), defined as eq. (2.3), is found to he > (< 0), then Pi (P2) may 
he identified as the correct partition, with a calculable probability p of misidentification. 

In the sections to follow we will introduce several such test variables T{P), and for 
each of them estimate the efficiency 1 — p. We will also see that this efficiency can be 
systematically improved in two directions: 

(i) by introducing further kinematic cuts on the event sample, at the price of a reduced 
statistics; 

(ii) by combining the information from several r(P) in a global test variable, for ex- 
ample, a likelihood test with the AT(P2, Pi) distributions interpreted as probability 
distributions. 

Both of the above points will be addressed in the sections starting from sec. 3. All the 
numerical results of our paper rely on event simulations, whose details are introduced in 
the next section 2.2. In the rest of sec. 2 we will introduce all the test variables and discuss 
variations around their definitions as well as their efficiencies. 

2.2 Event generation and selection 

We generated 100,000 parton-level events of top quark pair production followed by dilep- 
tonic final states using MadGraph 5, with proton-proton collisions at ^/s = 7 TeV [10]. 
We here do not assume any contribution from physics beyond the SM. The SM predicts 
a^t ~ 162 pb at next-to- next-to-leading-order [11].^ Hence the generated events correspond 
to ~ 13 fb~^. The dilepton mode gives rise to final states with two 6-tagged jets, two lep- 
tons, missing transverse energy. We consider the leptonic decays of the W bosons to an 
electron or muon.^ 

^The cross section was calculated with rrit — 173 GeV and CTEQ6.6 parton distributions [12]. 
^There are small contributions from the leptonic decays of the tau, for example, — >■ fi^Uf^Ur, but here 
we neglect them for the sake of discussion. 
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We apply event selection cuts on the simulated event sample following the 0.7 fb ^ 
ATLAS dileptonic top analysis [13]. We namely require 

• exactly two oppositely-charged leptons (ee, /x^u, e^) with pT > 25 GeV and |r/| < 2.5, 

• at least two jets with px > 25 GeV and \r]\ < 2.5, 

• dilepton invariant mass, rriu > 15 GeV, 

• that events in the same-flavor lepton channels must satisfy the missing transverse 
energy condition > 60 GeV and |m« — mz\ > 10 GeV, 

• that events in the different-flavor channel satisfy Ht > 130 GeV. The event variable 
Ht is defined as the scalar sum of the transverse momenta of the two leptons and all 
selected jets. In analogy with [13] we instead do not require ^Jt or mu cuts in this 
channel. 

We found that about 37% of the total simulated events pass these basic selection cuts. The 
motivation for including the mentioned cuts is to have an event sample that resembles as 
much as possible the actual experimental event samples in dileptonic tt production. As 
such, it will be our reference event sample throughout our analysis. We leave aside for the 
moment the important issues of: inclusion of hadronization, QCD radiation, and modeling 
of detector effects, that we will address in a forthcoming, more in-depth study [14]. 

2.3 Test variables: definition and discussion 

2.3.1 Invariant mass of visible final states: Ti variable 

Our first test variable is constructed from 
rufji, mentioned in the example of sec. 2.1. 



In particular, it could be defined directly 
as in eq. (2.4). However, for each event one 
obtains two rribi values, m[^^ (i = 1, 2), cor- 
responding to the two decay chains. Com- 
bining the two m^^ values in different ways 
gives rises to alternative definitions of the 
test variable, not all of which yield the 
same efficiency. We found the combination 
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We correspondingly define our first test 
variable as follows: 



max 
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iP^) , (2.6) 
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Figure 2. Distribution for the variable 
ATi{Pw, Pr) (see defs. in eqs. (2.6) and (2.3)) 
with Pyi/ and Pr denoting the wrong and the 
right partition respectively. The legend in the 
plot indicates the percentage of correctly parti- 
tioned events, before and after inclusion of the cut 
M^*(0) > 400 GeV, to be described in sec. 3.2. 



for a given partition Pj. The ATi{P]v, Pr) 
distribution is shown in fig. 2. 

The efficiency figure in eq. (2.5), as 
well as in the solid histogram in fig. 2, refers 

to the events that pass the basic selection cuts described in sec. 2.2, and actually includes 
one further consideration: if one of the partitions results in mbi > mf^^^, then the right 
partition can be clearly selected because mhi{Pji) < m^*^^. 

2.3.2 Mt2 of the whole decay chain: T2 variable 

Another event variable that may be used to construct a test variable of correct partitioning 
is Mt2 , the so-called "stransverse mass" , which is especially suited to topologies with pair- 
produced particles, each decaying into partly invisible final states [7, 8]. The general event 
topology that M'r2 is suited for is the following: 

YY ^ V,{pi)x{ki) + V2{p2Mk2) , (2.7) 

where each of the Vi denotes a (set of) visible state(s), whose total four-momentum is, in 
principle, entirely reconstructible, and Xj X denote invisible particles with identical mass. 
The decay mode (2.1) is an example of this very event topology: Vi should be identified 
with a b i pair (for each decay chain) and X with the neutrinos. 
For the event topology (2.7), the Mt2 variable is defined as 

MT2{'m^) = min max |m^^^ ( Pit> ^it, pI, , M^^ ( P2t> ^2T, pI, rhyA \ , 

(2.8) 

where My*"* is the transverse mass [15, 16] of the decay chain i (= 1 or 2), namely 

(m«)' =pJ + ml + 2 (^^|pj^|2 +p2^|kjy|2 + ^2 _ p.^ . k.^^ . (2.9) 

A few comments may help get an intuitive picture of the above definitions. First, it should 
be kept in mind that what is measured in the event topology of (2.7) are the visible 
particles' momenta, pi, and the total momentum imbalance in the transverse direction, 
jSj,. The definition in eq. (2.8) correspondingly marginalizes over the transverse momenta 
ki,2T of Xi X) that are not separately measured, by taking the minimum over all the trial 
momentum configurations (denoted with a tilde), whose sum equals the measured jS^. 

Second, the invisible mass is, likewise, unmeasured, hence Mt2 is a function of 
a trial value for this mass, m^. In the case of the decay (2.1) my. is of course known: 

= nil, ~ 0. However, the topology in eq. (2.7) applies also to several new-physics 
scenarios, and in this case the physical mass of the pair-produced x is in general 
unknown. An interesting feature of Mt2 is the kink structure of M^2^{m^) at rhy. = m^, 
provided the pj value spans a certain range in each decay chain [17, 18]. An alternative 
possibility for the kink structure of M^2^{Th^) to appear is to have the YY system boosted 
by a sizable amount of upstream transverse momentum [19, 20]. The M']r2-kink method 
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LHC, /s = 7 TeV 



LHC, \fs = 7TeV 




Figure 3. The distributions of T2 = (left panel) and AT2(Pw, Pr) (right panel), with Pw 

and Pr the wrong and right partitions respectively. The legend in the second panel indicates the 
percentage of correctly partitioned events, before and after inclusion of the cut M"(0) > 400 GeV, 
to be described in sec. 3.2. 

makes it possible to measure rriy. and my simultaneously, and even if the decay chain is 
not long enough to constrain all the unknowns in the event. 

The property of Mt2 most relevant to our discussion is that, when the input trial mass 
rh^ equals the true mass value , the upper endpoint of the Mt2 distribution corresponds 
to the parent particle mass my. We want to use this property to distinguish the correct 
against the incorrect partition of the final-state visible particles. We note explicitly that, 
in dileptonic ti, one can construct two kinds of Mt2- system and M/]^^ 

for the WW subsystem [21-24], the visible particles being respectively b£'^b£~ and i~^£~. 
Since only one partition is possible in M^^, this variable cannot be used directly as a 
test variable. It can however be used to construct other test variables, to be introduced in 
the next section. In the rest of the present discussion we will therefore specialize to M,f*2- 

As already mentioned, the M^^j distribution is bounded from above by the top quark 
mass mt if the partition is the right one, whereas it is not if the partition is the wrong one. 
This fact is shown in the left panel of fig. 3. One can therefore define the variable 

T2{Pi) = M^'^iPi) , (2.10) 

where we recall that Pi (i = 1, 2) denotes the two possible partitions of the 26 + 2i visible 
final states. 

In the right panel of fig. 3, we show the distribution of AT2{Pw, Pr) with the right 
(Pr) and wrong (Pw) partitions. The figure shows that the relation AT2{Pw, Pr) > 
holds for about 80% of the events that passed just the basic selection cuts described in sec. 
2.2, and that this percentage reaches as much as 97% after the M"(0) cut to be described 
in sec. 3.2. 
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To conclude this section, we mention that the Mt2 method of finding the right partition 
has been proven to be useful for mass measurements and/or event reconstruction in refs. [20, 
25-35]. 

2.3.3 MAOS-reconstructed rrit and mw mass: T3^4 variables 

A further set of variables to test the correct partition can be constructed from the obser- 
vation that the sum of final-state momenta in either decay chain must reconstruct, up to 
width effects, the parent-particle's invariant mass. The main obstacle to this, in the case 
of the event topology (2.7), is the fact that the invisible momenta of the two decay chains, 
ki and ^2, are not measured separately, but only their sum is, and only in the transverse 
plane. 

Systematic techniques exist however, enabling to obtain a best guess of the invisible mo- 
menta ki and k2.^ Among these techniques is the so-called M7^2-assisted on-shell (MAOS) 
reconstruction of invisible momenta [9]. The only assumption of the MAOS method is that 
the event topology should be of the kind of eq. (2.7). Then the transverse components of 
the invisible momenta, kir, k2T) can be estimated event by event to be the location of 
the minimum of the Mt2 variable constructed for the event. Recall in fact that, from eq. 
(2.8), Mj'2 is obtained from a minimization over all possible ki^2T configurations subject 
to the constraint kiy + k2T = i>T- 

Once kir and k2T are estimated by Mt2 , the longitudinal components of the invisible 
momenta may be determined from the on-shell relations 

(k + C'"')' = my , (km'' = ml (2.11) 

where, as usual, i labels the decay chain, my the parent particle mass, and the invisible 
particle mass. Eqs. (2.11) amount to a quadratic equation in the longitudinal components 
of ki, hence one has in general two solutions for each decay chain, k^^. Modulo this two- 
fold ambiguity, and assuming my, to be known (in our case it is m^ ~ 0), the MAOS 
reconstruction thus leads to a estimate of the full invisible momenta fcj. Specifically, kf^^"'^ 
has a roughly Gaussian distribution around kl'^^^ and a moderate spread, that can be 
systematically improved by requiring event cuts [9]. 

In our case, the whole argument in the previous paragraph can be applied to the full tt 
decay, or to the WW subsystem, because both share the same, mentioned topology (2.7). 
In particular, from the MAOS momenta of the WW system, kf^2°^'^^ , one can construct 
the invariant mass of the top quark as 

{mf:r)' = {Pb,+Pe,i + fc— '^^)' , (2.12) 

whereas from the MAOS momenta of the ti system, ^ one can construct the W 

boson invariant mass as 

(rnw^r)' = (p^,. + A;r°"**y ■ (2.13) 

^For the dileptonic tt decay, one of the presently most popular techniques is the neutrino weighting 
method, which has been used for the top quark mass measurement [36, 37] and for studying spin correlations 
in tt production [38] . 



-8- 



LHC, /s = 7 TeV 



LHC, \fs = 7TeV 




Figure 4. The m'^'^°^ (left panel) and m^°^ (right panel) distributions for the right and for the 
wrong final-state partition. The distributions include only events with real MAOS solutions for all 
partitions (see text for details). 



It is clear that, if the right partition is selected and in the limit where the MAOS momenta 
equal the true momenta in each decay chain, eqs. (2.12) and (2.13) should equal respectively 
the true rrit and mw values, up to finite decay widths. In practice, the MAOS momenta 
differ in general from the true momenta. However, the MAOS invariant mass distribution 
exhibits a peak structure around the true parent particle mass. Hence — to the extent 
that the spread around the peak value is not too large — the above eqs. (2.12) and (2.13) 
are still useful for our purpose, which is, we recall, not to measure the t or W mass, but 
to single out the correct final-state partition. We show in fig. 4 the m™^°^ (left panel) 
and the m^^^ (right panel) distributions calculated using the right partition (black solid 
histogram) and the wrong partition (red dashed histogram). 

The argument below eqs. (2.12)-(2.13) may be used to construct a test variable of 
correct partition by noting that 

\mY;r - m\{Pw) > \m'f;r - mliPn) , (2.14) 

and a similar inequality holds also for m'^°^. In reality, the notation in relation (2.14) 
is incomplete, because of the mentioned discrete ambiguity in the determination of the 
longitudinal components of the invisible momenta. This is a common problem for methods 
attempting to reconstruct invisible momenta, and based on polynomial on-shell equations 
of degree higher than one. One cannot distinguish the solutions among themselves, even 
though only one solution corresponds to the true one and the others are redundant. 

We construct our test variable treating these solutions in a symmetric way. We first 
define a quantity 

Aimr°^(a) = mr°^(p,,p,, (« = +,-), (2.15) 
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where we omitted the chain label, and a denotes the two MAOS solutions for the invisible 
momentum k™^^ in that decay chain. Thence we found that 

^ \Atmf:r{a)\iPw) > \^tmf:ri»)\iPR) ■■ 84.2% , (2.16) 

i=l,2;a=+,- i=l,2;a=+,- 

namely that the sum over chains and over solutions of the quantity defined in (2.15) is 
smaller when calculated on the right partition than on the wrong partition in as much as 
84% of the events.^ Consistently with the discussion in sec. 2.1, we then define our test 
variable as follows: 

UP,)^ J] |Aim--^(a)|(P,) • (2.17) 

1=1,2; «=+,- 

The whole line of reasoning below eq. (2.14) can be applied to the WW subsystem as 
well. Similarly as mf^^°^ in (2.15) one constructs the difference between m^^°^ and niw 
and symmetrizes between decay chains and solutions. We obtained 

^ \Awm'^y'{a)\{Pw) > Yl \Awm^y'{a)\{PR): 86.8%, (2.18) 

2=1,2; o=+,- 1=1,2; a=+,- 

and we define our test variable T4 as 

UPk)^ Y \Awm'^y'{a)\{Pk) . (2.19) 

i=l,2; a=+~ 

An important difference between the m™'^°'^-based method and the m^y'°^-based one 
should be emphasized. Within the mf^^'^^ method, the choice of final-state partition enters 
only when constructing the top invariant mass (see eq. (2.12)), whereas only one partition 
is possible in the estimate of kf^^"^-'^^ _ Conversely, within the m^^°^ method, the choice 
of final-state partition does come into play in the calculation of kf^^°^'^^ (see eq. (2.13)), 
namely when calculating Mt2 of the ti system. The important point is that wrong parti- 
tions sometimes result in complex solutions for (the longitudinal components of) kf^^°^'^^ 
much more often than right partitions do,^ and this can be used as a further criterion for 
identifying the right partition. From our simulation, we found that: 

(a) events with at least one complex solution (in any of the partitions) occur 38.5% of 
the time; 

(b) most importantly, events where a complex solution appears only in the wrong parti- 
tion occur 37.9% of the time - very close to the percentage in point (a).^ 



^We mention that we have tried a number of alternative combinations besides X^j^^ 2 a=H I ' I- 

particular _ I ■ li Tli^i 2 q=+ - I ' l^i niiiia,/3=+,- 1 • | and maxa,/3=+,-| • |. The combination proposed 

in eq. (2.16) is the one found to have the best efhciency. 

®Right partitions can also yield complex solutions, but only because of occasional failure of the numerical 
minimization in the Mt2 calculation, that we find to be the case only for a very small subset of all events 
(about 0.6%). We used a modified version of the Mt2 : :Basic_Mt2_332_Calculator algorithm of the Oxbridge 
MT2 / Stransverse Mass Library [39]. 

^Besides, we found: (i) events where a complex solution appears in both the right and the wrong partition 
(0.4% of the total events). Of course for this kind of events the information from m^!J?°° is unusable; (ii) 
events where a complex solution appears only in the right partition (0.2% of the total events). Events (i) 
and (ii) explain the difference between the percentages in points (a) and (b). 
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LHC, /s = 7 TeV 



LHC, \fs = 7TeV 




Figure 5. The distributions of AT3{Pw, Pr) (left panel) and AT4{Pw, Pr) (right panel), with 
Pw and Pr denoting the wrong and the right partition. The legend indicates the percentage of 
correctly partitioned events, before and after inclusion of the cut Af"(0) > 400 GeV, to be described 
in sec. 3.2. The efficiency figures in the right panel take into account the complex-MAOS-solution 
criterion, whereas histograms report only the events free of complex solutions. See text for full 
details. 



Prom the above points, it is clear that the ruy^^ method can be augmented by the following 
requirement: 

if the MA OS calculation returns at least a complex solution, hut only for one of the parti- 
tions, then this partition should he regarded as the wrong one. 

In the following we will refer to this statement as the complex-MAOS-solution criterion. 
This requirement has a probability of mistakenly discarding a correctly paired event as 
small as 5 per mil, as can easily be deduced from the above items. Note that this criterion 
is already taken into account in the 86.8% efficiency figure reported in eq. (2.18) and should 
be regarded as part and parcel of the AT4 method. 

The whole discussion between eq. (2.16) and the previous paragraph is illustrated in 
the plots of fig. 5, that report the distributions for AT3(PvK, -Pr) and AT4(Piy, Pr) (we 
recall the reader that AT is defined in eq. (2.3)). In particular, the efficiency figures in eqs. 
(2.16) and (2.18) correspond to the solid-line histograms, namely those before the M^*(0) 
cut to be introduced in sec. 3.2 (see figure legend). 

A careful reader may have noted that, in the AT4 panel, the percentage of events 
in the positive semiaxis looks, and in fact is, smaller that the corresponding efficiency 
(percentage of correctly partitioned events) indicated in the legend. This is especially true 
for the histogram after the Mf*(0) cut. The reason is the fact that the T4-method efficiency 
is calculated as the sum of the AT4 > criterion and, when applicable, of the complex- 
MAOS-solution criterion, that by itself belongs to the r4-method, as already mentioned. 
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Schematically, the total T4-method efficiency, €74, splits up as follows: 

eT4 = C-ec + R-en, (2.20) 

where C is the percentage of events with complex solutions, R is the percentage of events 
free of complex solutions {C + R =100%), ec denotes the efficiency of the complex-MAOS- 
solutions method and en the efficiency of the AT4 > method. For the events passing the 
M^{0) cut, we found R = 31% and cr ~ 90%. The R subset is the only one displayed in 
the dashed histogram on the right panel of fig. 5, and the efficiency e/j is the percentage 
of this histogram lying in the positive semiaxis. More importantly for the overall method 
efficiency er^ in eq. (2.20) is to note that C = 1 — R = 69% and that ec — 99%, to 
be compared with C = 38.5% (and ec basically the same) before the cut. These figures 
demonstrate that, after the cut, wrongly partitioned events tend to display complex MAOS 
solutions way more often than before the cut. This behavior is actually expected: if the T4 
variable, constructed with wrongly partitioned kinematics, may produce complex solutions, 
they will tend to proliferate if the kinematics gets more boosted, as is the case after the 
cut. The bottom line is that, from the point of view of the overall r4-method efficiency, an 
increase in the number of events with complex solutions C is an advantage, because the 
complex-MAOS-solution criterion has by itself an efficiency ec of nearly 100% of picking 
up the correct partition. 

To conclude this section, we note explicitly that the previous discussion refers to events 
at parton level, as assumed throughout this paper. Of course, effects such as momentum 
smearing may well destabilize the above conclusions. However, it seems reasonable to 
believe that the main effect of smearing will not be to have complex solutions migrate from 
the wrong to the right partition, but rather to increase the number of events where both the 
right and wrong partitions consist of only complex solutions. Such events represented only 
0.6% of our total events (see point (b) above), and we simply did not use the variable for 
such events. It is possible that this fraction of events increases, maybe even substantially, 
when including mismeasurement effects. Rather than simply disposing of the T4 variable 
for these events, one may instead consider introducing some more general complex-solution 
criterion than the one advocated in this section. One such criterion may be to choose as 
the true partition the one whose MAOS solution has, for instance, the largest real part. It 
is worth mentioning that the issue of complex solutions in event reconstruction has been 
recently discussed in detail in ref. [40], and the argument in the present paragraph is 
inspired by the main messages in that paper. We leave the question of the utility and 
efficiency of our thus-reformulated complex-MAOS-solution criterion to forthcoming work 
[14] where smearing effects will be taken into account. 

3 Improving the method efficiency 

3.1 Correlations and combinations of the test variables 

In the previous section we have shown that each of the introduced test variables Tj is able 
to find the correct final-state partition in > 80% of the total events that pass the basic 
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selection cuts. Already at the end of section 2.1 we have emphasized that the overall method 
efficiency may be systematically improved by combining the Tj and/or by introducing 
appropriate kinematic cuts. We will now address these two possibilities in turn. 

First of all, the possibility that combining the Tj does indeed allow to increase the 
overall method efficiency relies on the Tj being as weakly correlated as possible. Hence, 
we should first address the question of how large these correlations are. For the reader's 
convenience, we recall that our Tj are defined in eqs. (2.6), (2.10), (2.17) and (2.19) and 
that we also introduced a complex- MAOS- solution criterion at the end of sec. 2.3.3. Two- 
dimensional plots of the ATi{P\Y, Pr) — ATj{Pw, Pr) correlations are shown in fig. 6. 
These plots do not include events with complex solutions. 

The following comments are in order. First, it is apparent that Ti and T2 have a quite 
strong correlation. This is because the mbi value is directly used in the calculation of Mt2, 
that is a monotonically increasing function of mhe- Further, the MAOS-related variables, 
T3 and T4, are expected to have some degree of correlation with respectively Ti (related 
to mi,,) or T2 (related to M^g)- Arguments in support of this statement go as follows. 
Concerning the — Ti correlation, mf^^^ was defined as 

[m^ ) — m^,, -\- 2 \^ [mbe) + \Pbe\ \i^ I - Pbf • k 

whence the dependence on mbi is apparent. With regards to the T4 — T2 correlation, recall 
instead that the mf,£ value was used when calculating /c™^os-tt^ turn necessary for m^y"^. 
However, in practice, these correlations turn out to be rather weak, and this occurs because 
of the nontrivial structure of the MAOS solutions. For example, if is calculated from 
a balanced configuration, rnf^^°^ can also be written as 



^^maos)2 = + 2 i^^J (mbe)^ + |PT,fe£p|kr| cosh(?7b£ - r]^) - pT,bi ■ kr 

= {M^%y + 2^{mMy + \pTM\^\kT\ [cosh(%^ - r,,) - 1] , (3.1) 

for given values. Therefore, we can regard each of {T3,r4} vs. each of {Ti,T2} as 
approximately uncorrelated. 

In principle, one can merge the information from the various Tj even in presence 
of correlations among them, e.g. by constructing a likelihood function (not to mention 
more sophisticated pattern-finding techniques such as neural networks). The exploration 
of optimal ways of using the combined information from all the Tj is of course beyond the 
scope of the present paper. Besides, it should be emphasized that each of the introduced 
variables can actually be defined in a number of variations. One may adopt the approach 
of considering all these variations and using their information in a joint way, that properly 
takes into account correlations. This approach would also offer a way to improve the 
method efficiency. We refrain from entertaining such approach in this paper, which is 
devoted to the discussion of the main ideas behind each variable. 

Here we limit ourselves to show that even a very simple combination of the test vari- 
ables results in an appreciable improvement of the overall method efficiency. We choose 
{T2, T3, T4} as a set of independent test variables. Then, the arguably simplest algorithm 
to find the right partition by using these variables in a combined test is as follows: 
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Figure 6. Plots of the correlations between ATi{Pw, Pb.) and ATj{Pw, Pr)- The color code refers 
to the number of events in each plot pixel. 

0. Calculate T2, T3 and T4 for the two possible partitions: Pi and P2; 

1. If Ti(Pj) > m^^^ or T2(Pj) > mj, regard Pj as the wrong partition; 

2. If T/i{Pi) results in complex MAOS solutions, whereas no complex solutions are 
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present for the other partition, regard Pi as the wrong partition; 



3. If none of tlie above criteria is met, then choose Pi{P2) as the right partition if the 
majority of ATi(P2, Pi) > (< 0).*^ 

Point 1 corresponds to the already mentioned, and well-known, observation that mf,£ and 
M|,2 have a physical upper bound at m^^^ and mt respectively, if the partition is the 
correct one. Point 2 is the complex-MAOS-solution criterion enunciated at the end of sec. 
2.3.3. As also discussed there, this method selects the right partition for 98.4% of the events 
with at least one complex solution. According to our simulation, the combined algorithm 
0-3 described above returns the right partition for 89% of the full data set, which we find 
an already remarkable improvement over the single- variable efficiency. 

3.2 Improving by cuts on kinematic variables 

Up to this point, we have not introduced any event selection cuts, except for the basic cuts 
described in sec. 2.2, that, we checked, have only a marginal impact on the Tj efficiencies. 
Although the previous section shows that a method efficiency of 90% or even larger is 
arguably obtainable by just a wise combination of the Tj, we would like to show in this 
section that improvements are possible also by imposing appropriate kinematic variable 
cuts. This of course comes at the price of losing (some) event statistics. 

First, we note that the property of the test variables of being larger when evaluated 
with the wrong partition becomes more marked as the final states are more energetic. 
Intuitively, the larger the magnitude of the input kinematics, the more spread becomes the 
distribution of the variable, if it is calculated with the wrong partition. On the other hand, 
the distribution calculated with the right partition enjoys the same physical endpoints 
independently, of course, of the input kinematics. So, for a more boosted subset of events, 
one expects ATi{Pw, Pr) to be larger than zero more often. To single out such event subset, 
one may enforce cuts on appropriate kinematic variables, and parametrically improve the 
method efficiency as the cut gets stronger. 

For the dileptonic tt process, a well-known set of 'partition-insensitive' kinematic vari- 
ables - which namely treat all the visible particles on an equal footing, hence they are free 
of combinatorial ambiguities - is represented by: 

• M**(0), 

• m'^ = {pb^ +pb2 +pi^ +^'<?2)^ 

• Ht = J2iPiT, where i = h, 62, h, £2, 

• /If** 



*A more refined approach along the hnes of point 3 is to construct a function of the Ti, e.g., Yli Ti or a 
hnoar combination of T^'s, and choose one partition using the criterion f{Ti){Pw) > f{Ti){Pji). We found 
this kind of approach to also improve efficiency. The likelihood function is just one of such functions. 
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Figure 7. (Left panel) Combined method efficiency vs. loss in statistics with the inclusion of an 
inclusive cut. The x-axis represents the efficiency of the combined method described in sec. 3.f 
whereas the y-axis reports the number of events that passed one of the inclusive cuts, listed with 
different line/color codes in the legend. Events are normalized to the event subset that passed the 
basic selection cuts discussed in sec. 2.2. (Right panel) Same as the left panel, but for the T2 
variable only, rather than the combined method, on the x-axis. 



where M^*(0) is the transverse mass of the full tt system with rriui/ = [34, 41, 42], 

(m^*(0))^ = ml + 2 U\pT\^ + ml\^T\ - pr • with p = Pi , (3.2) 

and M^Q^^ is the smallest value of obtained over all possible partitions [25] . One can 
impose these cuts in addition to the basic event selection cuts of sec. 2.2. In the left panel 
of fig. 7, we show the flow of method efficiency vs. loss in statistics as one makes the cuts 
harder. See figure caption for full details. 

The figure shows that the most effective cut variable is My(0). One can reach 90% 
method efficiency (i.e. get the right partition 90% of the times) with just a 4% loss in event 
statistics. The efficiency rises to 99% when roughly halving the statistics. For the sake of 
completeness, we also mention that, for the quoted efficiencies of 90%, 95% and 99%, the 
cut value is respectively Mf*(0) > 277, 343 and 404 GeV. 

Because, for some event topologies to be described in the next section, the MAOS- 
based variables T3 and T4 are not usable, but the M7^2-based variable T2 still is, we also 
show in the right panel of fig. 7 a plot of efficiency gain vs. statistics loss for the T2 variable 
alone. 
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Figure 8. The decay process of eq. (4.1). The momenta pi and p[ are assmiied to be measurable, 
while ki give rise to missing transverse momentum. 

4 Generalizations of the method 

4.1 Case of chain-assignment as well as ordering ambiguities 

From the discussion so far, our method of finding the right partition can be promptly 
generalized, in particular to topologies relevant for new physics scenarios. Among the 
most popular such event topologies is the one in eq. (2.7), that is, pair-production of 
heavy particles, decaying into a set of visible particles plus a pair of invisible particles 
(possibly, dark- matter candidates). We here sketch this generalization, leaving full details 
to future work [14]. For the sake of discussion, we here focus on identical decay chains, 
but the argument presented below can be extended also to the case of non-identical decay 
chains. 

The first observation to make is that the decay of heavy parent particles can be in 
one step or multiple steps, depending on the mass spectrum and couplings of the new 
particles. The Ti and T2 variables of sees. 2.3.1 and 2.3.2 can be defined, and applied to 
find the right partition of the visible particles, even in the case of a one-step decay, e.g., 
gluino-pair production, followed by the three-body decay g — t- qq^ in i?-parity conserving 
supersymmetric models. On the other hand, to utilize the MAOS-based variables and 
T4, the decay process should be at least in two steps, as e.g. the dileptonic tt process is. 
For instance, if niq < nig, gluino-pair production will be dominantly followed by two-step 
decays, i.e., g ^ qq ^ QQXi- In terms of event topology, the cascade decay processes of 
interest to this discussion can be generalized as 

YY ^ Vi{pi)Xi + VM)X[ ^ Vi{pi)V2{p2)X2 + Vl{p[)VM)X!2 

^ . . . ^ Vi(pi) • • • vMxih) + v;{p[) ■ ■ ■ V^ip'Jx {k2) , (4.1) 
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for my > rnxi > "^^2 > • • • > "t-x ^^'^ "^y > ""^xj > "t-x^ > • • • > . This decay is also 
sketched in fig. 4.1. The rest of the present discussion will also assume = 'mx'i albeit 
this requirement is renounceable with appropriate modifications of the method. 

The second observation is that, in the context of this general event topology, there is, 
when partitioning the visible particles, an additional type of combinatorial ambiguity with 
respect to dileptonic ti. It is the ordering ambiguity, corresponding to the fact that Va'^ is 
produced before vj^^ (a = 1, • • • , n — 1). Of course, there is still, as in dileptonic ti, the 
combinatorial ambiguity of pairing the visible particles into two sets, i.e., distinguishing 
{Va} from {Vfj'}. Then, the total number of partitions of the visible particles is 

N = (number of pairings) x (number of orderings in one chain)^ = " x (n!)^ )(4-2) 

if the visible particles are not distinguishable among each other, as is the case for jets 
(barring taggings etc.). Then, one can calculate the MAOS invariant masses of 2n parent 
particles {Y, Xi, ■ ■ ■ , Xn-i, Y , X[, ■ ■ ■ , X'^_i), such that n MAOS-based test variables 
are defined. We will label them as T^^"'^ (a = 0, • • • , n — 1), for a given partition Pj 
(j = 1, ■ ■ ■ , N) and MAOS momenta k"^^^ (we recall that the MAOS determination of 
the invisible momenta consists in general of several solutions, collectively indicated as 
^maos^^ Thence, one can define: 

Tr°^(P„ = E |Ax„m^r(p,„ k^)\{P,) , (4.3) 

c=l,2; a=+,- 

where 

AxmZ°'{Pac, K) = rn^TiPac. K) " mx^ (4-4) 

with Pal = Yl^j=a+iPj-> Pa2 = ^]=a+iP'j^ ^0 = Y- These correspond to the T3 and 
T4 variables discussed in sec. 2.3.3. Actually, each T^^°^{Pj, k^^'^'^) can be obtained in 
different ways, since the MAOS momenta k"^^^ can be calculated from „C2 subsystems. 
In order to account for them in a symmetric way, one may define T^'^°^ as 

Each T^^°^ variable can then be used to decide on the right partition: this partition is 
likely to have the smallest r™^°^ value than all the other partitions, irrespective of whether 
combinatorial ambiguities concern the pairing or the ordering of the visible particle sets. 

The Ti and T2 variables are also usable because one can calculate several subsystem 
Ti's and T2's. For example, in the event topology (4.1), the subsystem T2, or briefiy Tl^*^, 
can be defined as T^'^, T2^'^^ , • • • , T2 " ^ such that T^-^ < my, T2^'^^ < mxi, 
■ ■ ■ , T2" < mXn-1 for the right partition. By sequentially testing the partitions in 

the order YY — t- • • • — )■ Xn-iX'^_^ systems, at each step selecting the partition with the 
smallest T™*^, one can find the most likely partition for the full system. 

We finally observe that the information from the above test variables can be merged 
into a combined method in various ways. The simplest algorithm may be 
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Calculate all the test variables as explained above. If T, " " > ..1/ = m 



a given partition Pj than in any other partition, one should regard Pj as the wrong 



(a) If the above procedure fails to pick up one partition, one may select the right partition 
Pj from the requirement that it gives the smallest value of Tj (with respect to the 
other partitions) for the largest number of test variables. 

We again emphasize that the efficiency of each Tj and of the combined method can be fur- 
ther improved by imposing cuts on 'partition-insensitive' variables, as discussed in sec. 3.2. 

Concrete examples of this generalized method will be the gluino pair-production and 
decay discussed above, and the pair- production of the second lightest neutralino X2) which 
decays to i^£~x^. We will consider in more details these new physics processes for certain 
benchmark points in future work [14]. 

4.2 Comments about the method's dependence on mass information 

We would like to conclude this section about generalizations with a few remarks about the 
question of the method's dependence on the knowledge of the mass of the pair-produced 
states, my, or that of the invisible daughters, rriy^.^ As a first remark, we emphasize again 
that, of our proposed T,, the variables Ti as well as T2 do not use the information on 
my at all. On the other hand, this information appears necessary for constructing the 
MAOS-based variables T3 and T/j^, at least according to the way they were defined in sec. 
2.3.3. In order to be able to use the variables T^^^ at all, one would therefore seem to need 
my and m^ to have been measured already. 

It is worth observing that, in many scenarios of new physics, the mass measurements 
in question are not expected to be affected by combinatorial ambiguities. This holds in 
particular in the case of the topologies of interest for our method, where one can construct 
the event variable Mt2- In fact, if, in absence of combinatorial ambiguities, my may be 
estimated from the endpoint of Mt2, then, in presence of combinatorial ambiguities, the 
same estimate can be performed using M^Gen [25] instead, i.e. the smallest Mt2 value for 
all possible partitions, which enjoys the same endpoint position as M'r2 constructed with 
the right partition. ^'^ In cases where the production cross-section is so small, or background 
contamination so high, that the mass information cannot practically be extracted along 
the above lines, one can consider using a modified version of our MAOS-based variables, 
wherein the parametric dependence on the true value of my is disposed of. To make an 
example, in place of the T3 definition, given in eq. (2.17), one may instead use the quantity 



which should be close to zero if the partition is the correct one, whereas it can be much 
larger if the partition is wrong. Hence, again, AT3{P]y,Pfi) is expected to prefer positive 

®We thank the Referee for triggering this discussion. 

^"The use of Mtgcu was also invoked in sec. 3.2 as a partition-insensitive cut variable. 



one. 




(4.6) 
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values. We note explicitly that the notation in eq. (4.6) is, similarly as in eq. (2.14), 
incomplete, in that the MAOS solution obtained from the on-shell relations (2.11) is not 
unique (see discussion below those relations). On the other hand, on the r.h.s. of eq. (2.11), 
one does not necessarily have to choose the parent particle mass. One has in general the 
following choices [34] 

• MAOSl: my [9] 

• MA0S2: Mt2 = min. 4 max{ M^^^M^^H 

(i) 

• MA0S3: Mj. , with i = 1,2 labeling the decay chain. 

The advantage of MA0S3 is that it allows to have a unique MAOS solution for each event, 
thereby allowing use of the variable as defined in eq. (4.6). In fact, we found the 
performance of AT^ comparable to that of AT^, as defined in sec. 2.3.3. The reason why 
we defined our MAOS-based variables according to the MAOSl scheme is mostly simplicity. 

The other unknown mass besides my is m^, upon which all the Tj variables bear 
dependence. While the necessity to input some value seems inescapable, we are actually 
quite confident that our method would perform rather well even in the case where the 
mass is yet to be measured at the moment of having to solve the combinatorial problem. 
We make the following considerations in support of this statement. First, as already 
mentioned in the general Mt2 discussion in sec. 2.3.2, an estimate, perhaps an accurate 
one, of may be obtained from the Mt2 kink [17-20], which however requires sufficient 
statistics. Even in absence of such statistics, and in presence of combinatorial ambiguities, 
the uppermost part of the MTGeni^x) P^o^ "^x plane may provide a rough 

idea of the mass. It is worth emphasizing that a rough input value for may well be 
sufficient in many scenarios. It is so at least in the case where <C my: it can be shown 
in fact [43] that, in this limit, MAOS solutions have corrections going as m^/my and are, 
therefore, largely insensitive to the specific value chosen for m^, so long as the mentioned 
mass hierarchy holds at least approximately. 

Clearly, all the issues mentioned in this section deserve further investigation, which is 
best carried out within specific new-physics benchmark points. This lies outside the scope 
of the present paper, mainly meant to propose our method and to apply it to the simplest 
yet relevant SM example we could think of, namely tt production. We will however return 
to all these issues in a forthcoming paper [14]. 

5 Conclusions and Outlook 

In this paper, we have proposed a novel method to resolve combinatorial ambiguities in 
hadron collider events involving two invisible particles in the final state. This method 
is based on the kinematic variable Mt2 and on the MAOS reconstruction of invisible 
momenta, that are reformulated as test variables Ti , namely testing the correct combination 
against the incorrect ones. The efficiency of each single Tj in the determination of the 
correct combination can then be systematically improved in two directions: by combining 
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the information from the different Tj and by introducing further cuts on suitable, partition- 
insensitive, variables. In this sense, our method is completely scalable. 

All the above program is illustrated in the specific, and per se interesting example of 
top anti-top production, followed by a leptonic decay of the W on both sides. We however 
emphasize that, by construction, our method is also directly applicable to many topologies 
of interest for new physics, in particular events producing a pair of undetected particles, 
that are potential dark-matter candidates. 

Our method opens a whole spectrum of generalizations and by-product issues, on which 
work is in progress [14]. The most urgent include: 

• Addressing whether and how the method is affected by the inclusion of effects such 
as QCD radiation, hadronization and signal degradation due to detector effects. The 
main challenge of this problem is that, after hadronization, the assignment of a 
'parton' to a given decay chain or to a certain position within the decay chain is no 
more well-defined. As such, it requires separate thought. 

• Generalizing the method to the combinatorial uncertainty due to the ordering of the 
visible particles in the decay chain, uncertainty which is not present in dileptonic tt. 
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